Picture for Haonan Li

Haonan Li

Multilingual Idioms in Sentences and Conversations Across High-, Medium-, and Low-Resource Languages

Add code
Jun 01, 2026
Viaarxiv icon

Reinforcement Learning with Robust Rubric Rewards

Add code
May 28, 2026
Viaarxiv icon

Visual Preference Optimization with Rubric Rewards

Add code
Apr 14, 2026
Viaarxiv icon

Controllable Reasoning Models Are Private Thinkers

Add code
Feb 27, 2026
Viaarxiv icon

SimuScene: Training and Benchmarking Code Generation to Simulate Physical Scenarios

Add code
Feb 11, 2026
Viaarxiv icon

Neural Theorem Proving for Verification Conditions: A Real-World Benchmark

Add code
Jan 26, 2026
Viaarxiv icon

TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models

Add code
Dec 16, 2025
Figure 1 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 2 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 3 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 4 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Viaarxiv icon

LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline

Add code
Oct 30, 2025
Figure 1 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 2 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 3 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Figure 4 for LLMBisect: Breaking Barriers in Bug Bisection with A Comparative Analysis Pipeline
Viaarxiv icon

K2-Think: A Parameter-Efficient Reasoning System

Add code
Sep 09, 2025
Viaarxiv icon

BALSAM: A Platform for Benchmarking Arabic Large Language Models

Add code
Jul 30, 2025
Figure 1 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Figure 2 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Figure 3 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Figure 4 for BALSAM: A Platform for Benchmarking Arabic Large Language Models
Viaarxiv icon